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Abstract 

In  this  paper,  we  present  a  new  routing  algorithm, 
whieh  we  eall  path-finding  algorithm  (PFA).  It  drasti- 
eally  reduees  the  possibility  of  temporary  routing  loops, 
whieh  aeeounts  for  its  fast  eonvergenee  properties. 
Like  other  path-finding  algorithms,  PFA  operates  by 
speeifying  the  seeond-to-last  hop  to  eaeh  destination, 
in  addition  to  the  distanee  to  the  destination.  A  de¬ 
tailed  proof  of  eorreetness  and  eomplexity  is  presented 
elsewhere.  PFA ’s  performanee  is  eompared  quantita¬ 
tively  by  simulation  with  DUAL  (a  loop-free  routing 
algorithm)  and  an  ideal  link- state  algorithm  (ILS).  A 
number  of  parameters,  ineluding  the  length  of  the  mes¬ 
sages  and  the  number  of  steps  required  for  eonvergenee, 
are  used  in  the  eomparison.  The  simulation  results  in- 
dieate  that  PFA  eonstitutes  a  very  effieient  distanee- 
veetor  algorithm.  It  provides  about  50%  improvement 
in  performanee  eompared  to  DUAL  in  terms  of  the 
eonvergenee  time  and  the  number  of  updates  after  sin¬ 
gle  link  failures,  and  provides  eomparable  or  better  eon¬ 
vergenee  speed  and  traffie  overhead  than  ILS,  with  or¬ 
ders  of  magnitude  fewer  CPU  eyeles. 

1  Introduction 

Routing  in  today’s  computer  networks  and  Internet 
is  accomplished  by  distributed  shortest-path  routing 
algorithms.  The  distributed  Bellman-Ford  (DBF)  al¬ 
gorithm  [1]  has  been  used  in  many  well  known  routing 
protocols.  However,  DBF  takes  a  long  time  to  con¬ 
verge  after  link  failure  or  link-cost  changes  because  of 
bouncing  effect  and  counting-to-infinity  problem  [5]. 
Path-finding  algorithms  can  be  an  attractive  alter¬ 
native  to  DBF  for  distributed  routing  as  they  elimi¬ 
nate  counting-to-infinity  problem.  However,  the  path¬ 
finding  algorithms  proposed  in  the  past  incur  substan¬ 
tial  temporary  loops  in  the  paths  specified  by  the  pre¬ 
decessor  information  before  they  converge,  which  leads 
to  slower  convergence. 

This  paper  presents  a  new  algorithm,  simply  called 
path  finding  algorithm  (PFA),  that  substantially  re¬ 
duces  the  number  of  cases  in  which  routing  loops  can 
occur.  Its  performance  is  compared  with  the  perfor¬ 
mance  of  an  ideal  topology  broadcast  (or  link  state) 
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algorithm  and  DUAL,  which  has  been  shown  to  be 
a  very  efficient  distance- vector  algorithm.  The  main 
feature  of  PFA  is  the  notion  of  predeeessor  or  second- 
to-last  hop.  Using  this  information,  each  node  can  in¬ 
fer  the  path  implicit  in  a  distance-table  entry  without 
excessive  overhead.  Each  node  maintains  the  shortest- 
path  spanning  tree  reported  by  its  neighbors,  and  uses 
this  information  and  the  information  regarding  the 
cost  of  the  adjacent  links  to  generate  its  own  shortest- 
path  spanning  trees.  The  fact  that  PFA  reduces  tem¬ 
porary  looping  accounts  for  its  superior  performance 
over  DUAL  and  the  ideal  link  state  algorithm. 

The  rest  of  the  paper  is  organized  as  follows.  Sec¬ 
tion  2  presents  the  network  model  assumed  in  PFA 
and  introduces  the  notation  used  throughout  this  pa¬ 
per.  Section  3  provides  the  description  of  PFA  along 
with  an  example  illustrating  key  aspects  of  its  oper¬ 
ation.  Section  4  presents  simulation  results  showing 
that  PFA  constitutes  an  attractive  alternative  for  the 
implementation  of  routing  protocols  that  rely  on  the 
exchange  of  vectors  of  distances.  Finally,  Section  5 
presents  our  conclusions. 

2  Network  Model 

We  have  modeled  a  computer  network  as  an  undi¬ 
rected  graph  represented  as  G(V,E),  where  V  is  the 
set  of  nodes  and  E  is  the  set  of  links  (or  edges)  con¬ 
necting  the  nodes.  Each  node  represents  a  router  and 
is  a  computing  unit  involving  a  processor,  local  mem¬ 
ory  and  input  and  output  queues  with  unlimited  ca¬ 
pacity.  A  functional  bidirectional  link  connecting  the 
nodes  is  assigned  a  positive  weight  in  each  direction. 
A  link  is  assumed  to  exist  in  both  the  directions  at 
the  same  time.  All  messages  received  (transmitted) 
by  a  node  are  put  in  the  input  (output)  queue  on  a 
first-come-first-serve  basis  and  are  processed  in  that 
order.  An  underlying  protocol  assumes  that 

•  Every  node  knows  its  neighbors.  This  implies 
that,  within  a  finite  time  a  node  detects  the  exis¬ 
tence  of  a  new  neighbor  or  the  loss  of  connectivity 
with  a  neighbor. 

•  All  packets  transmitted  over  an  operational  link 
are  received  correctly  and  in  the  proper  sequence 
within  a  finite  time. 

•  All  update  messages,  changes  in  the  link-cost,  link 
failures  and  link  recoveries  are  processed  one  at  a 
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update=0;  HTEMP^ 
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begin 
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respond  as  if  a  single  entry 
is  received  on  link  (i,  fc) 
copy  whole  routing  table  into  L>T EM P'’ 
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Function  ln_Path(Node,Neighbor,Oest,  neigh) 
begin 

P  ■<-  : 

^  ^Desi, neigh’ 

if  (p  =  Node)  then  return  (false) ; 
else  if  (p  =  Neighbor)  then  return(true); 
else  In_Path(Node,Neighbor  ,p,neigh) ; 


if  {RTEMP^  ^  4>)  then 
for  each  neighbor  b  do 

for  each  triplet  t  =  (j,  pi)  in  RPEMP^  do 

if  b  is  not  in  the  path  from  i  to  j 

then  ETEMP^’^  i-  ETEMP^’^  U  t; 


nd  ETEMP 


i  ,b 


cedure  RT_Update 

initialize  all  destinations  to  be  unmarked; 
for  any  unmarked  destination  j  do 
begin 

if  there  is  no  finite  distance  in  row  j 
then  mark  j  as  undetermined; 


iighbo 


jb 


re  Ch«nge(.,  fc,  d.*) 
u  changes  value  do 
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is  not  minimum  of  row  c)) 
then  mark  each  node  in  TV  as  undetermined 
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edure  Failure(t,  fc) 
link  (i,  fails  do 


mark  each  ; 
E'^.  ■«-  E'^.,  ; 


delete  column  k  in  iD, 
if  there  is  a  destinatic 
then  Call  RT_Update 
Go  to  Step  (4); 


i  TV  as  determin 
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copy  the  routing  vector  to  RTEMP^  if  the  distance  i 
predecessor  has  changed 


Figure  1:  PFA  Specification 


time  in  the  order  in  which  they  occur. 

When  a  link  fails,  the  corresponding  distance  entry 
in  a  node’s  distance  and  routing  tables  are  marked  as 
infinity.  A  node  failure  is  modeled  as  all  links  incident 
on  that  node  failing  at  the  same  time.  A  change  in 
the  operational  status  of  a  link  or  a  node  is  assumed 
to  be  notified  to  its  neighboring  nodes  within  a  finite 
time.  These  services  are  assumed  to  be  reliable  and 
are  provided  by  lower  level  protocols. 

Throughout  the  paper  the  following  notations  is  used: 

j:  Destination  node  identifier  j  ^  N 

b,k:  Neighbor  nodes 

Djj.-  Distance  entry  at  node  i  to  destination 

j  through  neighbor  k  in  the  distance  table 
Dj :  Distance  entry  at  node  i  to  destination 

j  in  the  routing  table 

Pjf.:  Predecessor  from  i  to  j  through  k  in  the 

distance  table 

pj :  Predecessor  from  i  to  j  in  the  routing  table 

Sj :  Successor  to  destination  j  from  a  given 

node  in  the  routing  table 
dik'.  Link  cost  from  i  to  neighbor  k 
Ni :  Set  of  neighbors  of  i 


3  PFA  Description 

Figure  1  defines  PFA.  PFA  consists  of  procedures 
used  for  topology  changes  (Failure,  Recover,  and 
Change),  path  traversal  (In_Path),  updating  the  rout¬ 
ing  table  (RT_Update)  and  a  procedure  to  process  up¬ 
date  messages  (Update).  The  main  feature  of  PFA  is 
the  notion  of  second-to-last  hop  or  predecessor.  Using 
predecessor  information,  each  node  can  infer  the  path 
implicit  in  a  distance  entry  without  excessive  over¬ 
head. 

Each  node  maintains  a  distance  table,  a  routing  ta¬ 
ble  and  a  link-cost  table.  The  distance  table  is  a  matrix 
containing  the  distance  and  predecessor  entries  (path 
information)  for  all  the  destinations  through  all  its 
neighbors.  The  routing  table  is  a  column  vector  of 
minimum  distance  to  each  destination  and  its  corre¬ 
sponding  predecessor  and  successor  information.  The 
link-cost  table  lists  the  cost  of  each  link  adjacent  to 
the  node;  the  cost  of  a  failed  link  is  considered  to  be 
infinity.  An  update  message  contains  the  source  and 
the  destination  node  identifiers,  and  the  distance  and 
predecessor  for  one  or  more  destinations. 

When  a  node  i  receives  an  update  message  from 
its  neighbor  k  regarding  destination  j,  the  distance 
and  the  predecessor  entries  in  the  distance  table  are 
updated  (Step  1).  A  unique  feature  of  PFA  is  that, 
node  i  also  determines  if  the  path  to  destination  j 


(OJ) 


(infinity,-) 


(c) 


Figure  2:  Example  of  the  algorithm’s  operation 


through  any  of  its  other  neighbors  {b  £  N,  |6  ^  k} 
includes  node  k.  If  the  path  implied  by  the  predeces¬ 
sor  information  reported  by  node  b  includes  node  k, 
then  the  distance  entry  of  that  path  is  also  updated 
as  Djj,  =  -H  Dj  and  the  predecessor  is  updated 
as  ptj,  =  pj.  Thus,  a  node  can  determine  whether  or 
not  an  update  received  from  k  affects  its  other  dis¬ 
tance  and  routing  table  entries.  Before  updating  the 
routing  table,  node  i  checks  for  all  simple  paths  to  j 
reported  by  its  neighbors,  and  the  shortest  of  these 
simple  paths  becomes  the  path  from  i  to  j  (Procedure 
RT .Update).  This  implies  that  at  each  stage,  node 
i  checks  for  the  simple  paths  and  avoids  loops.  Link 
or  node  failures,  recoveries  and  link-cost  changes  are 
handled  similarly  (Steps  (5),  (6)  and  (7)). 

In  contrast  to  PEA,  which  makes  a  node  i  check 
the  consistency  of  predecessor  information  reported 
by  all  its  neighbors  each  time  it  processes  an  event 
involving  a  neighbor  k,  all  previous  path-finding  algo¬ 
rithms  [2,  3,  4]  check  the  consistency  of  the  predecessor 
only  for  the  neighbor  associated  with  the  input  event. 
This  unique  feature  of  PEA  accounts  for  its  fast  con¬ 
vergence  after  a  single  resource  failure  or  recovery  as 
it  eliminates  more  temporary  looping  situations  than 
previous  path- finding  algorithms. 

The  following  example  illustrates  the  working  of  the 
algorithm.  Consider  the  four  node  network  shown  in 
Eigure  2(a).  Let  PEA  be  used  in  this  network.  All 
links  and  nodes  are  assumed  to  have  the  same  propa¬ 
gation  delays.  Link-costs  are  as  indicated  in  the  figure 
and  are  assumed  to  be  the  same  in  both  the  directions. 
Node  i  is  the  source,  j  is  the  destination  and  node  k 
and  b  are  the  neighbors  of  node  i.  The  arrows  next  to 
links  indicate  the  direction  of  updates  messages  and 
the  label  in  parentheses  gives  the  distance  and  the 
predecessor  to  destination  j.  The  figure  focuses  on 
update  messages  to  destination  j  only. 


When  link  (y,  k)  fails,  nodes  j  and  k  send  update 
messages  to  their  neighboring  nodes  as  shown  in  Eig¬ 
ure  2(b).  In  this  example,  node  k  is  forced  to  report 
an  infinite  distance  to  j  as  nodes  b  and  i  have  reported 
node  k  as  part  of  their  path  to  destination  j.  Node 
b  processes  node  k’s  update  and  selects  link  {b,j)  to 
destination  j.  This  is  because  of  step(2)  of  the  algo¬ 
rithm  which  forces  node  b  to  purge  any  path  to  node 
j  involving  node  k.  Also,  when  i  gets  node  k^s  update 
message,  i  updates  its  distance  table  entry  through 
neighbor  k  and  checks  for  the  possible  paths  to  desti¬ 
nation  j  through  any  other  neighboring  nodes.  Thus, 
a  node  examines  the  available  paths  through  its  other 
neighboring  nodes  and  updates  the  distance  and  the 
routing  table  entries  accordingly.  This  results  in  the 
selection  of  the  link  (i,i)  to  the  destination  j  (Eig¬ 
ure  2(c)).  When  node  i  receives  6’s  update  reporting 
an  infinite  distance,  node  i  does  not  have  to  update 
its  routing  table  as  it  already  has  correct  path  infor¬ 
mation  (Eigure  2(d)).  Similarly,  updates  sent  by  node 
k  reporting  a  distance  of  11  to  destination  j  will  not 
affect  the  path  information  of  nodes  i  and  6.  This  il¬ 
lustrates  how  step (2)  of  PEA  helps  in  the  reduction  of 
the  formation  of  temporary  loops  in  the  explicit  paths. 

The  proofs  of  correctness,  convergence  and  com¬ 
plexity  of  PEA  are  given  elsewhere  [6].  The  worst- 
case  complexity  of  PEA  has  been  found  to  be  0(h)  for 
single  recovery /failure,  h  being  the  height  of  the  tree. 

4  Simulation  Results 

To  gain  insight  into  the  average-case  performance 
of  the  algorithms,  we  have  developed  simulations  us¬ 
ing  an  actor-based,  discrete-event  simulation  language 
called  Drama  [7],  together  with  a  network  simulation 
library.  Link  failures  and  recoveries  are  simulated  by 
sending  link  status  message  to  the  nodes  at  the  end 
points  of  the  appropriate  links.  Node  failures  can  be 
treated  as  all  links  connecting  to  that  node  going  down 


Table  1:  Simulation  Results  for  Los-Nettos 


Parameter 

PFA 

DUAL 

lUS  1 

mean 

sdev 

mean 

sdev 

mean 

sdev 

1  Link-J 

'I'ailure  | 

Event  Count 

?o77 

TO 

TO 

5gi^ 

o 

Packet  Count 

13.5 

6.01 

32.6 

11.8 

27.0 

5.8 

Duration 

2.86 

0.74 

6.7 

1.33 

4.2 

0.88 

Operation  Count 

62.4 

18.03 

69.9 

18.6 

724.1 

27.3 

1  Link-Recovery  | 

Event  Count 

go 

15.5 

07 

TO 

30 

TM 

Packet  Count 

18.0 

5.04 

17.0 

7.25 

31.9 

1.86 

Duration 

2.93 

0.46 

3.71 

0.88 

3.86 

0.46 

Operation  Count 

109.6 

24.6 

65.7 

7.45 

944.0 

45.8 

1  iNode- 

Failure  | 

Event  Count 

135.6 

7g:o 

TO 

50 

WTE 

g:g 

Packet  Count 

39.8 

17.5 

45.5 

3.26 

26.7 

7.19 

Duration 

5.82 

2.85 

6.91 

0.99 

4.09 

0.51 

Operation  Count 

195.0 

112.2 

123.9 

50.2 

702.9 

204.3 

1  JNode-H 

.ecovery  | 

Event  Count 

250 

rT7:g 

go 

Oo 

oO 

13:4 

Packet  Count 

30.4 

10.3 

41.0 

12.4 

51.1 

10.8 

Duration 

3.18 

0.38 

4.7 

0.44 

4.4 

0.5 

Operation  Count 

274.9 

136.4 

145.2 

66.5 

1698.8 

478.4 

at  the  same  time  and  the  link  cost  changes  can  be 
treated  as  a  link  failing  and  recovering  with  a  new  link 
cost.  All  simulations  are  done  assuming  unit  propa¬ 
gation  time  and  zero  packet  processing  time  at  each 
node.  If  a  link  fails  when  packets  are  in  transit,  the 
packets  are  dropped. 

For  simulating  the  routing  algorithm,  a  node  re¬ 
ceives  a  packet  and  responds  to  it  by  running  the  rout¬ 
ing  algorithm  and  queueing  the  outgoing  packets  and 
processing  the  updates  one  at  a  time  in  the  order  in 
which  they  arrive.  In  our  simulations,  packet  process¬ 
ing  time  is  kept  as  zero.  Drama’s  internals  ensure  that 
all  the  packets  at  a  given  time  are  processed  before  new 
updates  are  generated. 

The  performance  of  PFA  has  been  compared  with 
DUAL  and  an  ideal  link  state  (ILS)  algorithm  which 
uses  Dijkstra’s  shortest-path  algorithm  [1]  at  each 
node.  We  have  instrumented  our  simulations  using 
the  following  four  quantities: 

Events:  total  number  of  updates 

Packets:  total  number  of  packets  transmitted  over 
the  network 

Duration:  total  time  elapsed  for  the  algorithm  to 
converge 

Operations:  total  number  of  operations  performed 
by  the  algorithm 

Counters  are  used  to  instrument  these  quantities. 
These  counters  can  be  reset  at  various  points  and  the 


values  of  these  counters  are  printed  when  the  algo¬ 
rithm  converges  (that  is,  when  the  event  queue  emp¬ 
ties). 

Simulation  results  are  obtained  for  all  single  failures 
and  recoveries  of  links  and  nodes.  The  routing  algo¬ 
rithm  was  allowed  to  converge  after  each  such  change. 
The  simulations  were  run  on  several  network  topolo¬ 
gies  after  a  series  of  tests  on  smaller  topologies  for 
debugging  purposes.  The  results  for  link  and  node, 
failure  and  recovery  for  Los-Nettos  are  presented  in 
Table  1.  The  results  for  other  network  topologies  can 
be  found  elsewhere  [6]. 

PFA  has  a  better  overall  average  performance  than 
ILS  after  the  recovery  of  a  single  node  or  link.  The  per¬ 
formance  of  PFA  is  comparable  to  ILS  after  the  failure 
of  a  single  node  or  a  link.  This  is  a  remarkable  im¬ 
provement  over  DUAL  which  requires  approximately 
twice  the  number  of  steps  to  converge  than  ILS  after 
failures.  For  link  failures,  the  operation  count  of  PFA 
is  almost  the  same  as  that  of  DUAL  with  the  duration 
to  converge  being  half  that  of  DUAL.  The  number  of 
packets  (messages)  exchanged  among  nodes  is  almost 
50%  less  than  that  of  DUAL.  However,  the  event  count 
and  the  operation  count  is  about  2  to  3  times  higher 
than  DUAL;  the  operations  count  of  PFA  is  up  to  an 
order  of  magnitude  less  than  that  of  ILS. 

5  Conclusion 

In  this  paper,  we  have  presented  a  new  path-finding 
algorithm,  PFA,  that  reduces  the  occurrence  of  tem¬ 
porary  routing  loops  without  the  need  for  internodal 
synchronization  mechanism  or  the  exchange  of  com¬ 
plete  path  information.  The  performance  of  the  al- 


gorithm  is  compared  with  DUAL  and  an  ideal  link- 
state  algorithm  quantitatively  by  simulation.  In  terms 
of  convergence  time  after  link  failures,  PFA  provides 
about  50%  improvement  in  performance  as  compared 
to  DUAL  by  exchanging  lesser  number  of  messages 
than  DUAL  and  its  performance  is  comparable  to  that 
of  ILS.  The  worst-case  complexity  of  the  algorithm  has 
been  shown  (elsewhere)  to  be  0(h),  where,  h  is  the 
height  of  the  tree. 
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